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Abstract. An important problem in quantum information theory is the mathematical character- 
(-H ' ization of the phenomenon of quantum catalysis: when can the surrounding entanglement be used 

' to perform transformations of a jointly held quantum state under LOCC (local operations and clas- 

sical communication) ? Mathematically, the question amounts to describe, for a fixed vector y, the 
set T{y) of vectors x such that we have x ® z ^ y ® z for some z, where -< denotes the standard 
majorization relation. 

Our main result is that the closure of T{y) in the li norm can be fully described by inequalities 
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on the norms: WvWp for all p ^ 1. This is a first step towards a complete description of 

T{y) itself. It can also be seen as a £p-norm analogue of Ky Fan dominance theorem about unitarily 
invariant norms. The proofs exploits links with another quantum phenomenon: the possibiliy of 
multiple-copy transformations (x®" -< y®" for given n). The main new tool is a variant of Cramer's 
theorem on large deviations for sums of i.i.d. random variables. 
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The increasing interest that quantum entanglement has received in the past decade is due, in 
' part, to its use as a resource in quantum information processing. We investigate the problem of 

, entanglement transformation: under which conditions can an entangled state |(/)) be transformed into 

another entangled state ? We restrict ourselves to LOCC protocols: Alice and Bob share \4>) 
and have at their disposal only local operations (such as unitaries Ua ® Is for Alice) and classical 
communication. Nielsen showed in [13] that such a transformation is possible if and only if ^ A^, 
where is the majorization relation and A^, A^ are the Schmidt coefficients vectors of and \ip) 
^ • respectively. Practically in the same time, Jonathan and Plenio [8] discovered a striking phenomenon: 

5^ , entanglement can help LOCC communication, without even being consumed. Precisely, they have 

found states {(p) and 1-0) such that {(p) cannot be transformed into {tp), but, with the help of a catalyst 
state Ix), the transformation \(f>) ® |x} — > IV') ® lx) is possible. When such a catalyst exists, we say 
that the state \(j)) is trumped by lip) and we write A^ ~<t A^. We say then that \p) can be transformed 
into IV') by entanglement-assisted LOCC or ELOCC. It turns out that the trumping relation is much 
more complicated that the majorization relation; one can easily check on two given states \(p) and 
\ip) whether A^ -< A^ is satisfifted or not, but there is no direct way to determine if A^ -<t A^. 
Later, Bandyopadhyay et al. [Ij discovered that a similar situation occurs when trying to transform 
by LOCC multiple copies of into \ip). It may happen that the transformation |-0) is not 

possible, but when considering n copies, one can transform \p)^^ into |i/))®". The phenomenon of 
multiple simultaneous LOCC transformations, or MLOCC, has been intensively studied in the last 
years and many similarities with ELOCC have been found |6l[7j- 

In this note, we make some progress towards a complete characterization of both ELOCC and 
MLOCC. We show that a set of inequalities involving ip norms (see the remark on Conjecture [T] at 
the end of the paper) is equivalent to the fact that \(j>) can be approached by a sequence of states |0„) 
which are MLOCC/ELOCC-dominated by \Tp). An important point is that we allow the dimension 
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of to exceed the dimension of Our proof uses probabilistic tools; we introduce probability 
measures associated to and ji/') and we use large deviation techniques to show the desired result. 

Interestingly, the result can be reversed to give a characterization of (.p norms that is similar to 
the Ky Fan characterization of unitarily invariant norms. We refer the interested reader to Section [3l 
The rest of the paper is organized as follows: in Section [2] we introduce the notation and the general 
framework of entanglement transformation of bipartite states. We also state our main result, Theorem 
[TJ The theorem is proved in Section [H Conclusions and some directions for further study are sketched 
in Section [H The appendix at the end of the paper contains basic results from large deviation theory 
needed in the proof of the main theorem. 

Acknowledgement: we thank the referees for several helpful remarks that improved the presen- 
tation of the paper. 



2. Notation and statement of the results 

For e N*, let Pd be the set of c?-dimensional probability vectors : Pd = {x G R"^ s.t. Xi > 
Q.Y^Xi — 1}. If X S Pd, we write x^ for the decreasing rearrangement of x, i.e. the vector € Pd 
such that X and x^ have the same coordinates up to permutation, and x\ ^ We shall also write 

^^max for x'[ and x^^^ for the smallest nonzero coordinate of x. 

There is an operation on probability vectors that is fundamental in what follows: the tensor product 
®. If a; = (xi, . . . ,Xd) & Pd and x' = {x'^, . . . ,x'j^,) G Pd' , the tensor product x ® x' is the vector 
{xix'j)ij G Pdd'', the way we order the coordinates of x x' is immaterial for our purposes. We also 
define the direct sum x (B x' as the concatenated vector {xi, . . . ,Xd,x[, . . . ,x'^,) e R'*"'''^ . 

It a; e Pd satisfies Xd = 0, it will be useful to identify x with the truncated vector {xi, . . . ,Xd-i) G 
Pd-i- This identification induces a canonical inclusion Pd-i C Pd- Thus, every vector x ^ Pd can be 
thought of as a vector of Pd' for all d' ^ by appending d! — d null elements to x. We consider thus 
the set of all probability vectors P<oo = Ud>o ^d- In other words, P<oo is the set of finitely supported 
probability vectors. 

Let us now introduce the classical majorization relation [UlEj. \i x,y G R'* we define the subma- 
jorization relation as follows 

k k 

x<^y iff. Vfc G {!,... d}, < ^y|, 

i=l i=l 

and the majorization relation -< as 

d d k k 

x<y iff. ^Xi^'^y, and Vfc G {1, . . . d - 1}, ^ ^ ^ . 

i—l i—1 i—1 i—1 

We usually work with probability vectors, for which both relations coincide. However, it will be 
useful in the proof to work with deficient vectors (of total mass less than 1) and to use submajorization. 
We write Sd{y) for the set of vectors x in Pd which are majorized by y. It is well-known that Sd{y) is 
a compact convex set whose extreme points are the vectors obtained by permuting the coordinates of 
y; many other characterizations of Sd{y) are known |14[ [3]. This relation behaves well with respect to 
direct sums and tensor products: x ^ y implies x (B z ^ y O z and x (E) z ^ y z for any z G P<oo- The 
majorization relation has been shown to have a very important role in quantum information. Nielsen 
has proved [13] that a state |0) belonging to Alice and Bob can be transformed into the state IV') by 
using local operations and classical communication (LOCC) if and only if 

-< A^, 
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where (respectively A^) is the vector of eigenvalues of the density matrix for Alice's system when 
the joint system is in the state \(f)) (respectively Not long after Nielsen's theorem, Jonathan 

and Plenio have discovered a very intriguing phenomenon: there exist states |0) and such that 
the transformation \(j)) — > ji/i) is impossible by LOCC, but, with the aid of a catalyst state |x), the 
transformation \4>) ® \x) IV') ® \x) becomes possible; we say that \4>) can be transformed into |'0) by 
Entanglement-assisted LOCC or ELOCC. This result has motivated a more complex relation between 
probability vectors: if a;,y G Pd, we say that y trumps x and write x ~<t V if there exists z G P<oo 
such that X <® z ^ y ® z. It is important to require that the auxihary vector z (called the catalyst) is 
finitely supported (see Remark [2]). Given y £ Pa, 'we write Td(y) for the set of d-dimensional vectors 
trumped by y, that is 

Td(y) ^ {x <^ Pd s.t. X <T y}- 

The set Td{y) is in general larger than Sd{y) [4J and much more complicated to describe. Up to now, 
there is no known simple procedure to decide whether x £ Td(y) or not. Hence, finding a tractable 
characterization of the relation -<t (or, equivalently, of the set Td{y)) has become an important open 
problem in quantum information theory [16]. The geometry of Td{y) has been studied in [3l|4]: it is a 
bounded convex set that it is neither closed nor open (provided y is not too simple) . We shall introduce 
now another important extension of LOCC transformations. Bandyopadhyay et al [l] found an example 
of entangled states and with the property that the LOCC transformation \(j>) is impossible 

but, when one tries to transform multiple copies of the states, the transformation \tp)'^"' 
becomes possible. We say that MLOCC-dominates \4>). We introduce the analogue of the trumping 
relation for probability vectors: 

X'<My iff 3n ^ Is.t.x®" 2/®", 
and the set of probability vectors MLOCC-dominated by a given vector y: 

Mdiv) = {x e Pd s.t. X -^M y}- 

Not much is known about the set Md(y). It has been studied in i6j and shown to have many similarities 
with the set Td{y): for example it is neither closed nor open in general. One important point is that, 
for all y, we have Md{y) C Td{y) (see [6]). 

We report progress towards a description of the sets of Md{y) and Td{y). The main ingredient of 
our approach is the following observation. Consider two vectors x,y & Pd- Whether x ~< y, x ~<m y, 
X -<T y or not depends only on the non-zero coordinates of x and y. Thus, it is possible to <I<mI -<t- 
compare vectors of different sizes by appending the necessary amount of zero coordinates to the end 
of one of them. Hence, it seems more natural (at least from a mathematical point of view) to consider 
the sets 

T<oo{y) = {x e P<oo s.t. X y} ^ {x e P<oo s.t. 3z e P<oo s.t. x(^z^y®z}^ [J Td'iy) 
and 

M<oo(y) = {xe P<oo s.t. X <M y} = {x€ P<^ s.t. 3n ^ 1 s.t. x®" -< 2/®"} = U Md'iy). 

d'^d 

The important point here is that both T^oo{y) and M<oo(2/) do not depend anymore on the size 
of y, but only on the non-null coordinates of y. Of course, if y G Pd, Td{y) = T<oo(y) n Pd and 
Md{y) = Af<oo(2/) n Pd] this shows that the sets T<oo(y) and M^ao{y) are not closed either in general 
(otherwise Td{y) and Md{y) would be also closed). We then write T<oo(y) and M^aoiy) to denote the 
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closure taken with respect to the £i-norm, the natural topology in this setting (see Remark [S]). Recall 
that for p ^ 1, the £p norm of a vector x G Pd is defined as 

(1) ikiip=(E< 

\i=l 

and \\x\\^ = maxxi- We now come to our main result: 
Theorem 1. Consider two vectors x,y ^ P<oo- The following assertions are equivalent: 

(a) X e M<oo(y), 

(b) X e T^ooiy), 

(c) \\x\\p^\\y\\pVp;?l. 

Remark 1. Note that instead of demanding that \\x\\p ^ ||y||p for allp^ 1, it suffices to ask for x,y G 
Pd that the inequality holds for all p £ [l,Pmaxix,y)], where Pmax{x,y) = logd/ {logymax - logx^ax)- 
The inequalities forp > Pmax follow by simple computation. For such results in a more general setting, 
see [I2]. 

Remark 2. It is important to see at this point how the set T^oo{y) is related to the set Td{y). First 
of all, note that if we drop the closure, we have equality: T^oo{y) H = Td{y) for y € Pd- However, 
when taking the £i closure of the left hand side, we obtain a strict inclusion: Td{y) C T^oo[y)^Pd- An 
example for such a vector is provided by the phenomenon of infinite- dimensional catalysis, discovered 
by Daftuar gj. Take y = (0.5,0.25,0.25) and x = (0.4,0.4,0.2). It is obvious that x ^ Td{y) because 
2^3 < 2/3 and the condition Xd ^ yd is necessary for x G Td{y). However, there exist an infinite- 
dimensional catalyst z = (1 — a)(l,Q!, a^, . . . ^a^ , . . .), where a — , such that x ® z < y ® z and 
\\x ® z\\p ^ \\y ® z\\p for all p ^ 1. Note that z is Ip-bounded and thus \\x\\p ^ ||y||p for all p 1. By 
the preceding theorem, we have that x G T<oo(2/) H P3. For further remarks on this topic, see Section 

m 

Remark 3. The use of the £1 norm is natural in this context from a mathematical point of view since 
P<oo is a subset of the norm-closed hyperplane of £1 defined by J^^i — 1- ""-s explain also how 
it relates to other physically motivated distances between the approaching states \4>n) and the original 
state 10). Recall that x is the eigenvalue vector of the reduced density matrix corresponding to Alice's 
(or, equivalently to Bob's) part of the system. From the details of the proof (see also Section\^, one 
sees that the size of the approaching vectors Xn increases with n. So, in order to compare p and p„, we 
have to realize them as density matrices on the same Hilbert space Ti. Moreover, we can suppose that 
the two states are diagonalizable in the same basis (Alice can achieve this by applying a local unitary 
basis change). As usually, we append the necessary number of zero eigenvalues to x in order to have 
the same size as Xn. We obtain the following equality: 

\\X - Xn\\i = \\p - PnWtr- 

So, for Alice's part of the system, we obtain a convergence in the trace norm sense. It is well known 
that the trace norm distance is related to the probability that the two states can be distinguished by 
some measurement. Moreover, by using some classical inequalities (see [15], Chapter 9), the fidelity 
F{p,pn) can be shown to converge to 1. 

3. A £p VERSION OF Ky Fan theorem 

In this section, we explain how Theorem[T]can be seen as an analogue of Ky Fan dominance theorem. 
We refer to [2] for background. We denote by the space of complex x matrices. A norm ||| • ||| 
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on is said to be unitarily invariant if |||t/^y||| = |||A||| for all unitary matrices U, V. A norm || - 1| 
on R'' is said to be symmetric if 



for all choices of signs in {±1}'' and all permutations a G &d- It is well-known ([2j, Theorem IV. 2.1) 
that unitarily invariant norms on are in 1-to-l correspondance with symmetric norms on R'^ 
(consider the restriction of 1 1 1 ■ 1 1 1 to diagonal matrices) . 

Examples of unitarily invariant norms are given by Ky Fan norms, defined for A; = l,2,...,c?by 



where si{A) ^ • • ■ ^ Sd{A) denote the ordered singular values of a matrix A. The Ky Fan dominance 
theorem asserts that these norms are extremal among unitarily invariant norms in the following sense: 
if A,B satisfy |||^|||(fc) ^ for any fc = 1, • • • ,d, then |||A||| ^ \\\B\\\ for any unitarily invariant 

norm ; this condition can also be formulated as s{A) s{B), where s(-) denotes the vector of singular 
values of a matrix. 

This gives a way to derive an infinite family of inequalities from a finite one. However this may be 
a too strong requirement and one can wonder what happens for an important special class of unitarily 
invariant norms: the Schatten p-norms (or noncommutative £p norms), defined for p ^ 1 by 



To state our result, we need to compare matrices of different sizes. If d < d' we identify ^d with the 
top- left corner of ^d' ', this gives a natural inclusion ^d C ^d' and we write ^<oo = [Jd-^d- Note 
that the tensor product of matrices is a well-defined operation on ^<oo- 

Theorem 2. Let A, B G ^d- The following are equivalent 

(1) IPIIIpS^ \\\B\\\p for allp^l. 

(2) There exists m ^<oo a sequence (An) so i/iai lim„^oo |||A„-^|||i = and |||^®"||| ^ 
for all unitarily invariant norms |||.||| (or, equivalently, so that s(A^") s{B®^)). 

Of course, a main difference between this result and Ky Fan dominance theorem is that condition 
(ii) here is hard to check and involves infinitely many inequalities. 

Proof (sketch). Because of the bijective correspondance between unitarily invariant norms on matrices 
and symmetric norms on vectors, it is enough to prove the theorem for positive diagonal matrices. 
This is almost the content of the equivalence (a) <J=4> (c) of Theorem 1. The only sHght remark that 
we need in order to get condition (2) as stated here is the following: in Lemma[2]below, it follows from 
the proof that we can actually choose the integer n so that x"^^ <w y®^ for any N ^ n. □ 



We shall prove the sequence of implications (a) => (b) (c) ^ (a). The first two are well known; we 
sketch their proof for completeness. The third is the most difficult one and represents our contribution 
to the theorem. 

(a) => (b) Because the closure is taken with respect to the same topology (ii) for both M^^{y) 
and T<oo(y), it is enough to show M<oo(y) C T^oo{y). Let x G M<oo(y) and consider n such that 



||(xi, . . . ,a;d)|| = ||(±x^(i),...,±x^(d))|| 



fc 




4. The proof of the theorem 
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x®" ^ trick here (see [6]) is to use the following z as a catalyst 

z = a;®("-i) © a.®(«-2) <^ y © . . . © x ® © y®(«-i). 

For simplicity we do not normalize z, but this is irrelevant. The vector z has been constructed such 
that 

x z = a;®" © w and y®z = y®" © w, 

where w is the same in both expressions. This implies that x ® z ^ y ® i.e. x € r<oo(y)- 

(b) =J> (c) Let z e P<oo be the catalyst for x <t U- x ® z ~< y (g) z. A function (p : R'' ^ R is said 
to be Schur-convex if a ^ 5 implies (p{a) ^ ^pib). It is well-known (see [HI [l^) that if ft. : R — > R is a 
convex function, then p : x i-^ SiLi h{xi) is Schur-convex. Consequently, the functions x \\x\\^ are 
Schur-convex for p ^ 1. Moreover, they satisfy the identity \\x ® z\\p = ||a;||p||z||p, and similarly for y. 
Since ||z||p is finite, we get that ||a;[|p ^ \\y\\p- To show that the same is true for x £ r<oo(?/), it suffices 
to check that the set of a; G such that ||a;||p < ||y||p is norm-closed; this follows from the inequality 
II ■ Hp < II -111- 

(c) (a) We will adapt some techniques used by G. Kuperberg in a slightly different context 
|10) . In our proof, we allow deficient vectors, i.e. vectors with total mass smaller than 1, and we use 
submajorization. 

As in [lOj, we associate to a positive vector x S R'* the measure fix = ^'i=i ^i^iogxi, where Sz 
is the Dirac measure at point z. The basic property is that the tensor product operation of vectors 
corresponds to the convolution of associated measures: 

The convolution of two measures ^ and u is defined by the relation 

/i * v{A) = (^ X I/) ({(x, ?/) e R^ : X + y G A]) . 

Moreover, if ^ and v are probability measures and and denote independent random variables 
with laws respectively ^ and then /i * is the law of X^ + X^- 

The following lemma gives a way to prove majorization using comparison of the tails of the associated 
measures 

Lemma 1. Let x and y he two vectors oflV^ with non-negative components. Consider the measures 
and /iy associated with x and y. Assume that, for all t G R, fix[t, oo) ^ fiy[t, oo). Then x y. 

Proof. Note that 

flx[t, 00)= ^ Xi= ^ Xi. 

i:\ogXi^t i:Xi^c->Lp(t) 

Thus, for all m > 0, ^i-x^^u^i ^ J^tyi^uVi- simplicity, we assume first that all coordinates of y 
are distinct. We will show by induction on fc G {1, . . . , d} that X^iLi ^ Si=i Vi ■ For the first step; 
use u — y[ to conclude that x[ ^ yj. Now, fix fc G {1, . . . , d— 1} and suppose that ^ J2^=i Vi- 

If ^k+i ^ J/fc+i) the induction step is obvious. If x\^^^ > y^^i, we use u — xj.^-^ to get 

k+l k+1 

»=i i:x,-^xl^^ i-y,>xl^^ i--yi^yi+i 

This completes the induction when y has distinct coordinates. The general case follows by approximat- 
ing y by y + £„, where (£„) is a suitable sequence of positive vectors tending to 0. The approximation 
is possible since the set of vectors y majorizing a fixed x is closed. □ 
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We now get to the key lemma in our argument. We shall use a slightly modified version of Cramer 
large deviations theorem — see Appendix. 

Lemma 2. Let x,y in R*^, with nonnegative coordinates. Assume that for any 1 ^ p ^ oo, we have 
the strict inequality \\x\\p < \\y\\p. Then there exists an integer n such that x®" y®". 

Proof. Consider x and y satisfying the hypotheses of the lemma. We can assume by multiplying both 
vectors by a positive constant K that ||y||i = 1. Let p = 1 — ||x||i > 0. We introduce the measures 
p,x and fly associated to x and y; p,y is a. probability measure but fix is not, so we add a mass at — oo 
by setting Jl^ = fix + p5 -oo. Let X and Y be random variables distributed according to Jlx and fj,y 
respectively. We denote by (X„) (resp. (Yn)) a sequence of i.i.d. copies of X (resp. Y). We are going 
to show that for n large enough 

(2) Vi e R, P{Xi + ■ ■ ■ + Xn nt) P(Yi + • • • + r„ ^ nt). 

This is equivalent to showing that 



dfiT = / d-pT ^ / dfil 

J nt J nt 

which, by the previous lemma implies a;®" y®". Note that the asymptotic behavior of the quantities 
appearing in ^ is governed by Cramer's theorem. Let /„(i) = P(^i + ■•■ + X„ ^ nty^" and 
(?„(i) — P{Yi + ■ ■ ■ + Yn ^ nt)^/". Applying Cramer's theorem (see Appendix), we obtain 



fit) := lim /„(i) = 



g{t) lim gn{t) 



1 - p if i < e(a:|a: ^ -oo) 

g-^xW otherwise. 

'i if t E(r) 

g-Ai^(t) otherwise. 



Note also that the log-Laplace of X, defined for A e R by Ax (A) — logEe , is related to the £p 
norms of x: 

VA>0, Ax(A) =log||a:||^+i. 

The same holds for Y: Ay(A) = log ||?;||^+} and thus we have Ax(A) < Ay(A) for A ^ 0. 

Let Mx — esssupX = log ||a;||cx; and My — esssupl" = log \\y\\oo ', by hypothesis Mx < My. First 
of all, note that fn{t) — for t ^ Mx, so it suffices to show that /„ ^ gn on (— oo, Mx], for n large 
enough. We claim that f < g on {—oo, My), and thus on {—co,Mx]. Indeed, for E(y) ^ t < My, 
the supremum in the definition of Ay (t) is attained at a point Aq ^ (cf Appendix) , so we have that 

fit) < e-(^oi - Ax(Ao)) < ^-(Aot - Ay(Ao)) ^ ^^^^^ 

where the strict inequality follows from the fact that Ax (A) < Ay (A), for all A ^ 0. For t < 
E(y), git) — 1 and fit) ^ 1 — p < 1. Moreover, the functions / and g admit finite limits in — oo: 
Hmt_>_oo /(i) = l—p and limt^_oog(i) — 1. Thus, on the compact set [— oo,Mx], the functions / 
and g are well-defined, non-increasing, continuous and satisfy f < g. 

We now use the following elementary fact: if a sequence of non-increasing functions defined on a 
compact interval / converges pointwise towards a continuous limit, then the convergence is actually 
uniform on / (for a proof see [17] Part 2, Problem 127; this statement is attributed to Polya or to 
Dini depending on authors). We apply this result to (/„) and (g^) on the interval / = [— oo,Mx] to 
conclude that the convergence is uniform for both sequences. As f < g, we can therefore find n large 
enough such that /„ ^ gn on /, and thus on R. This is equivalent to ^ and completes the proof of 
the lemma. □ 
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Remark 4. It is possible to avoid the use of Cramer's theorem by using low-technology estimates on 
large deviations probability instead, as done in [lO]. This requires additional care to get the required 
uniform bounds and slightly obfuscates the argument. The only advantage is to give explicit bounds for 
the value of n in Lemma [H which our compactness argument does not. These bounds are quite bad 
anyway, and for example do not allow to replace the £i-closure in the main theorem by a £p-closure for 
some p < I. 

Proof of (c) ^ (a) (continued) Recall that x and y are such that ^ ||y||p for any p ^ 1 and 
that we want to find, for any e > small enough, a vector G M^oo{y) such that ||a; — x^Hi < e. 
Let dx (resp. dy) be the number of nonzero coordinates of x (resp. y). We proceed as follows : let 
< e < 2dxXrnin and consider the (deficient) vector x'^ obtained from x by subtracting e/2dx to each 
of its nonzero coordinates. This implies that is a positive vector, ||a: — a;^||i = e/2 and that 
satisfies the hypotheses of Lemma [21 Applying the lemma, we obtain the existence of an integer n 
such that (x^)®" j/®". 

Remember that x'^ is deficient; we now enlarge it into a vector x^ G P<oo by adding mass e/2. But 
since we want to keep the property xf" -<w y®" (which is identical to xf" -<u, y®"), a safe way to 
do this is to add a large number of coordinates, each of them being very small. More precisely, let 
Xe = © 5®^ , where 5D = e/2 and (5 is a positive number such that 5{x'^)^l_ ^ min((x^)JJj;jj, yjjjjn). 
We claim that xf" -< y®", that is, for any k^l, 

k k 

(3) ^ E(y®")i- 

1=1 j=i 

Indeed, S has been chosen so that the c?" largest coordinates of xf" are exactly the coordinates of 
(x^)«^", so when 1 ^ k d!^, ^ follows from the relation (x^,)®" y®". If < fc ^ d]^, the 

inequality also holds since the choice of 5 guarantees (xf ")^ < (y®")i- Finally if A: ^ d'J, © holds 
trivially since the right-hand side equals 1. 

In conclusion, x^" -< y®", and thus x^ S M<oo(j/). But x^ has been constructed such that ||x — 
Xe\\i ^ £ and thus x G M<oo(j/) which completes the proof of the theorem. 

5. Conclusion and further remarks 

In conclusion, we are able to give a nice description of the ^i-closure of the set T<oo(y)- However, 
this closure may be substantially larger than the usual closure 1^(2/) in Pd, and requires approximation 
by vectors with growing support. Our result can be seen as a contribution to a conjecture attributed 
to Nielsen [3]: 

Conjecture 1. Fix a vector y E Pd- Then a vector x £ Pd belongs to Td{y) if and only if the following 
conditions are verified. 

(1) Forp^l, ||x||p^ ||y||p. 

(2) ForQ<p^l, \\x\\p ^ \\y\\p. 

(3) Forp<0, ||x||p^ ||y||p. 

M. KHmesh announced a proof of this conjecture in a short communication [9], but the solution 
has not appeared in print yet. However, his methods are different from our approach (private com- 
munication). Note that the definition of || • ||p given in ^ is extended to any p G R*. For p < 1, 
II • lip is not a norm in the usual sense. We have shown that the condition (1) above is equivalent 
to X G r<oo(2/)- Notice however that r<oo(y) is in general larger than Td{y); note also that the set 
of X G that satisfy conditions (1-3) is closed. The "only if part of the conjecture follows from 
standard convexity /concavity properties of functionals || • ||p, see |14[ [3]. 
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This question also appears in [6] where it is formulated using the Renyi entropies. For any real 
p ^ 1, the p- Renyi entropy is defined for a; e as 

i/,(.) = ^log, 

The limit case p — I corresponds to the usual entropy. The conditions (1-3) of the conjecture can be 
concisely reformulated as "i?p(x) ^ Hp{y) for all p". 

An intermediate notion is the following: for y ^ Pd, let T^ooiu) be the set of vectors x £ Pd such 
that there is a sequence {xn) in T^ooiv) tending to x, with a uniform bound on the size of the support 

of Xn- We think that a description of T<oo{y) could be related to the set of vectors which satisfy 
conditions (1) and (2) — but not necessarily (3) — in Conjecture [H 

There is one more consequence of our main theorem we would like to discuss. Recall that when 
defining catalysis, we insisted on the fact that the catalyst should be finitely-supported. Let Poo C £i 
be the set of infinite-dimensional probability vectors, and for y in P<oo, define the set T'{y) of (finitely 
supported) vectors trumped by y using infinite catalysts: 

T'{y) = {x e P<oo s.t. 3z G Poo s.t. a: (g) z ^ y (g) z}. 

As shown in j3j (Section 4.3), in general T<oo(y) ^ T'{y). However, since x € T'{y) implies ||a;||p < \\y\\p 
for all p ^ 1, it follows from our main theorem that T^oo{y) = T'{y). 

6. Appendix: On Cramer's theorem 

We review here some facts from large deviations theory. A complete reference for all the material 
contained here is [5]. Let X be a random variable taking values in [—00,00). We allow X to equal 
—00 with positive probability; this is a nonstandard hypothesis. We however exclude the trivial case 
'P{X = —00) = 1. We write E for the expectation. We assume also that the conditional expectation 
E(Ar|X / —00) is finite. The cumulant generating function Ax of the random variable X is defined 
for any A G R by 

Ax(A) =logEe^^. 

It is a convex function taking values in (—00, +00]. Its convex conjugate A^^, sometimes called the 
Cramer transform, is defined as 

(4) h*x{x) ^ \x ~ Kx{X). 

Note that Kx is a smooth and strictly convex function on [0;+oo]. Moreover, A^(0) = E(X|Ar ^ 
—00) and limA^+00 A^(A) = esssup(Ar). Consequently, for any x such that E(A"|Ar ^ —00) < a; < 
esssup(A'), the supremum in (01) is attained at a unique point A ^ 0. We now state Cramer's theorem 
in a suitable formulation 

Proposition 1. Let X he a [—00, +oo)-valued random variable such that Ax(A) < +00 for any A ^ 0. 
Let (Xi) be a sequence of i.i.d. copies of X . Then for any t G R 

lim i logP(X, + . . . + X„ 5. tn) ^ ^I'^SP^^ ^ ' ^ ^(^1^ ^ 



-Ax{t) otherwise. 



Proof. Let X denote the random variable X conditioned to be finite, that is for any Borel set P C R 

P{x eb) = -^p(a: g p), 
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where p = 'P{X = — cx)). A consequence of the classical Cramer theorem ([5], Corollary 2.2.19) states 
that 

(5) Vt e R, lim - logP(Xi + • ■ • + 1„ ^ in) = - inf k\{s). 

One checks that = Ax — log(l — p), and consequently 

(6) A^-Al+log(l-p). 
Note also that 

(7) P{Xi + --- + Xn>tn)^{l- p)"P(Xi + ■ • • + 1„ ^ tn). 

Finally, note that the infimum on the right hand side of ([5]) is null for t ^ E(A') and equals A*j.(i) 
for t > E(X). This follows from the fact that the convex function t ^ A*,(t) attains its zero minimum 
at t = E(X) and is increasing for t > E(A'). Thus, we can rewrite equation ^ as: 

(8) lim i logP(l, + . . . + 1„ ^ in) = 1° . 
n^cc n I — A^(i) otherwise. 

The proposition follows from the equations ^ and □ 
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